Search results for "Ensemble method"

showing 10 items of 10 documents

Comparing Boosting and Bagging for Decision Trees of Rankings

2021

AbstractDecision tree learning is among the most popular and most traditional families of machine learning algorithms. While these techniques excel in being quite intuitive and interpretable, they also suffer from instability: small perturbations in the training data may result in big changes in the predictions. The so-called ensemble methods combine the output of multiple trees, which makes the decision more reliable and stable. They have been primarily applied to numeric prediction problems and to classification tasks. In the last years, some attempts to extend the ensemble methods to ordinal data can be found in the literature, but no concrete methodology has been provided for preference…

Ordinal dataBoosting (machine learning)Preference learningEnsemble methodsComputer sciencebusiness.industryDecision tree learningDecision treesDecision treeLibrary and Information SciencesMachine learningcomputer.software_genreEnsemble learningBoostingMathematics (miscellaneous)RankingPattern recognition (psychology)Psychology (miscellaneous)Artificial intelligencePreference learningStatistics Probability and UncertaintybusinesscomputerRankings

researchProduct

A comparison of ensemble algorithms for item-weighted Label Ranking

2023

Label Ranking (LR) is a non-standard supervised classification method with the aim of ranking a finite collection of labels according to a set of predictor variables. Traditional LR models assume indifference among alternatives. However, misassigning the ranking position of a highly relevant label is frequently regarded as more severe than failing to predict a trivial label. Moreover, switching two similar alternatives should be considered less severe than switching two different ones. Therefore, efficient LR classifiers should be able to take into account the similarities and individual weights of the items to be ranked. The contribution of this paper is to formulate and compare flexible i…

Label RankingRandom ForestBaggingEnsemble MethodBoosting

researchProduct

A novel ensemble computational intelligence approach for the spatial prediction of land subsidence susceptibility.

2020

Land subsidence (LS) is a significant problem that can cause loss of life, damage property, and disrupt local economies. The Semnan Plain is an important part of Iran, where LS is a major problem for sustainable development and management. The plain represents the changes occurring in 40% of the country. We introduce a novel-ensemble intelligence approach (called ANN-bagging) that uses bagging as a meta- or ensemble-classifier of an artificial neural network (ANN) to predict LS spatially on the Semnan Plain in Semnan Province, Iran. The ensemble model's goodness-of-fit (to training data) and prediction accuracy (of the validation data) are compared to benchmarks set by ANN-bagging. A total …

Environmental Engineering010504 meteorology & atmospheric sciencesArtificial neural networkEnsemble forecastingElevationComputational intelligenceK-fold cross-validation (CV)Land cover010501 environmental sciences01 natural sciencesPollutionRandom forestSemnan PlainStatisticsDrawdown (hydrology)Land-subsidence susceptibilityEnvironmental ChemistryEnsemble methodWaste Management and DisposalGroundwaterEnvironmental Sciences0105 earth and related environmental sciencesMathematics

researchProduct

Boosting for ranking data: an extension to item weighting

2021

Gli alberi decisionali sono una tecnica predittiva di machine learning particolarmente diffusa, utilizzata per prevedere delle variabili discrete (classificazione) o continue (regressione). Gli algoritmi alla base di queste tecniche sono intuitivi e interpretabili, ma anche instabili. Infatti, per rendere la classificazione più affidabile si `e soliti combinare l’output di più alberi. In letteratura, sono stati proposti diversi approcci per classificare ranking data attraverso gli alberi decisionali, ma nessuno di questi tiene conto ne dell’importanza, ne delle somiglianza dei singoli elementi di ogni ranking. L’obiettivo di questo articolo `e di proporre un’estensione ponderata del metodo …

boosting weighted ranking data ensemble methods decision treesSettore SECS-S/01 - Statistica

researchProduct

Ensemble methods for item-weighted label ranking: a comparison

2022

Label Ranking (LR), an emerging non-standard supervised classification problem, aims at training preference models that order a finite set of labels based on a set of predictor features. Traditional LR models regard all labels as equally important. However, in many cases, failing to predict the ranking position of a highly relevant label can be considered more severe than failing to predict a trivial one. Moreover, an efficient LR classifier should be able to take into account the similarity between the items to be ranked. Indeed, swapping two similar elements should be less penalized than swapping two dissimilar ones. The contribution of the present paper is to formulate more flexible item…

Ensemble methodsRanking dataLabel rankingSettore SECS-S/01 - Statistica

researchProduct

Multi-layer intrusion detection system with ExtraTrees feature selection, extreme learning machine ensemble, and softmax aggregation

2019

Abstract Recent advances in intrusion detection systems based on machine learning have indeed outperformed other techniques, but struggle with detecting multiple classes of attacks with high accuracy. We propose a method that works in three stages. First, the ExtraTrees classifier is used to select relevant features for each type of attack individually for each (ELM). Then, an ensemble of ELMs is used to detect each type of attack separately. Finally, the results of all ELMs are combined using a softmax layer to refine the results and increase the accuracy further. The intuition behind our system is that multi-class classification is quite difficult compared to binary classification. So, we…

Artificial intelligencelcsh:Computer engineering. Computer hardwareExtreme learning machineEnsemble methodsComputer scienceBinary numberlcsh:TK7885-7895Feature selection02 engineering and technologyIntrusion detection systemlcsh:QA75.5-76.95Machine learning0202 electrical engineering electronic engineering information engineeringVDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550Multi layerExtreme learning machinebusiness.industryIntrusion detection system020206 networking & telecommunicationsPattern recognitionComputer Science ApplicationsBinary classificationFeature selectionSignal ProcessingSoftmax function020201 artificial intelligence & image processinglcsh:Electronic computers. Computer scienceArtificial intelligencebusinessClassifier (UML)EURASIP Journal on Information Security

researchProduct

ENSEMBLE METHODS FOR RANKING DATA

2017

The last years have seen a remarkable flowering of works about the use of decision trees for ranking data. As a matter of fact, decision trees are useful and intuitive, but they are very unstable: small perturbations bring big changes. This is the reason why it could be necessary to use more stable procedures, as ensemble methods, in order to find which predictors are able to explain the preference structure. In this work ensemble methods as BAGGING and Random Forest are proposed, from both a theoretical and computational point of view, for deriving classification trees when ranking data are observed. The advantages of these procedures are shown through an example on the SUSHI data set.

ranking data ensemble methods bagging random forestSettore SECS-S/01 - Statistica

researchProduct

Evaluation of Ensemble Machine Learning Methods in Mobile Threat Detection

2017

The rapid growing trend of mobile devices continues to soar causing massive increase in cyber security threats. Most pervasive threats include ransom-ware, banking malware, premium SMS fraud. The solitary hackers use tailored techniques to avoid detection by the traditional antivirus. The emerging need is to detect these threats by any flow-based network solution. Therefore, we propose and evaluate a network based model which uses ensemble Machine Learning (ML) methods in order to identify the mobile threats, by analyzing the network flows of the malware communication. The ensemble ML methods not only protect over-fitting of the model but also cope with the issues related to the changing be…

Computer scienceintrusion detection0211 other engineering and technologiesDecision tree02 engineering and technologycomputer.software_genreComputer securitymobiililaitteet0202 electrical engineering electronic engineering information engineeringsupervised machine learningSoarAndroid (operating system)tietoturvata113021110 strategic defence & security studiesta213business.industrymobile threatsensemble methods020206 networking & telecommunicationsFlow networkEnsemble learninganomaly detectionmachine learningkoneoppiminenMalwareThe InternetbusinesscomputerMobile device

researchProduct

Ensemble methods for ranking data with and without position weights

2020

The main goal of this Thesis is to build suitable Ensemble Methods for ranking data with weights assigned to the items’positions, in the cases of rankings with and without ties. The Thesis begins with the deﬁnition of a new rank correlation coefﬁcient, able to take into account the importance of items’position. Inspired by the rank correlation coefﬁcient, τ x , proposed by Emond and Mason (2002) for unweighted rankings and the weighted Kemeny distance proposed by García-Lapresta and Pérez-Román (2010), this work proposes τ x w , a new rank correlation coefﬁcient corresponding to the weighted Kemeny distance. The new coefﬁcient is analized analitically and empirically and represents the main…

ranking databoostingweighted Kemeny distancebaggingSettore SECS-S/01 - Statisticalinear mixed modelensemble method

researchProduct

A weighted distance-based approach with boosted decision trees for label ranking

2023

Label Ranking (LR) is an emerging non-standard supervised classification problem with practical applications in different research fields. The Label Ranking task aims at building preference models that learn to order a finite set of labels based on a set of predictor features. One of the most successful approaches to tackling the LR problem consists of using decision tree ensemble models, such as bagging, random forest, and boosting. However, these approaches, coming from the classical unweighted rank correlation measures, are not sensitive to label importance. Nevertheless, in many settings, failing to predict the ranking position of a highly relevant label should be considered more seriou…

Artificial IntelligenceDecision treesGeneral EngineeringLabel rankingWeighted ranking dataEnsemble methodBoostingComputer Science ApplicationsExpert Systems with Applications

researchProduct